Universal Access To All Knowledge
Home Wayback Machine | Archive-It | Blog | Heritrix
Search: Advanced Search
Anonymous User (login or join us) Upload

Most Downloaded Items
Last Week more

  1. Webwide Crawldata 2011-03-09T00:28:53PST to 2011-03-11T13:42:25PST
    13 downloads
  2. Webwide Crawldata 2011-03-13T18:04:52PDT to 2011-03-13T18:17:45PDT
    10 downloads
  3. Webwide Crawldata 2011-03-20T00:45:19PDT to 2011-03-19T23:59:45PDT
    10 downloads
  4. Webwide Crawldata 2011-04-10T23:07:53PDT to 2011-04-13T17:04:46PDT
    10 downloads
  5. Webwide Crawldata 2011-04-14T13:58:15PDT to 2011-04-14T10:28:36PDT
    10 downloads

Most Downloaded Items more

  1. Webwide Crawldata 2011-05-01T03:04:05PDT to 2011-05-01T00:35:28PDT
    293 downloads
  2. Webwide Crawldata 2011-03-09T00:28:53PST to 2011-03-11T13:42:25PST
    284 downloads
  3. Webwide Crawldata 2011-03-11T21:42:25PST to 2011-03-12T01:22:38PST
    284 downloads
  4. Webwide Crawldata 2011-03-09T03:54:35PST to 2011-03-11T13:42:09PST
    280 downloads
  5. Webwide Crawldata 2011-03-13T06:43:18PDT to 2011-03-13T07:55:18PDT
    280 downloads

Spotlight Item

Webwide Crawldata 2011-05-01T03:04:05PDT to 2011-05-01T00:35:28PDT
Internet Archive crawldata from Webwide Crawl, captured by crawl415.us.archive.org:wide from Sun May 1 03:04:05 PDT 2011 to Sun May 1 00:35:28 PDT 2011.

About the Internet Archive

Background

Frequently Asked Questions

85,614 itemsWelcome to Wide Crawls

Wide crawls of the Internet conducted by Internet Archive. Access to content is restricted. Please visit the Wayback Machine to explore archived web sites.

Browse by Subject / Keywords

All items (most recently added first) - RSS

Sub-Collections

Wide Crawl started April 2012
Web wide crawl with initial seedlist and crawler configuration from April 2012.
14,691 items
Wide Crawl started January 2012
Web wide crawl with initial seedlist and crawler configuration from January 2012 using HQ software.
32,276 items
Wide Crawl started March 2011
Web wide crawl with initial seedlist and crawler configuration from March 2011. This uses the new HQ software for distributed crawling by Kenji Nagahashi.
8,833 items
Wide Crawl started October 2010
Web wide crawl with initial seedlist and crawler configuration from October 2010
16,348 items
Wide Crawl started October 2011
Web wide crawl with initial seedlist and crawler configuration from March 2011 using HQ software.
13,121 items
Wide Crawl started September 2010
Web wide crawl with initial seedlist and crawler configuration from September 2010
339 items

Recently Reviewed Items (more)

This Just In (more)

Webwide Crawldata 2012-06-03T05:34:52PDT to 2012-06-03T00:23:25PDT
6 minutes ago

Webwide Crawldata 2012-06-03T06:04:48PDT to 2012-06-03T00:39:51PDT
9 minutes ago

Webwide Crawldata 2012-06-03T04:46:23PDT to 2012-06-03T00:08:16PDT
15 minutes ago

Webwide Crawldata 2012-06-03T05:20:08PDT to 2012-06-03T00:21:10PDT
23 minutes ago

Webwide Crawldata 2012-06-03T04:26:41PDT to 2012-06-02T23:53:53PDT
34 minutes ago


  

Terms of Use (10 Mar 2001)